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Abstract 

We discuss a spin glass reminiscent of the Random Energy Model, which allows 
in particular to recast the Parisi minimization into a more classical Gibbs varia- 
tional principle, thereby shedding some light on the physical meaning of the order 
parameter of the Parisi theory. As an application, we study the impact of an exten- 
sive cavity field on Dcrrida's REM: Despite its simplicity, this model displays some 
interesting features such as ultrametricity and chaos in temperature. 

1 Introduction 

After years of intensive research and important advances (pQ, [7], [9], [12]), the Parisi 
theory [TO], originally developed in the study of the Sherrington-Kirkpatrick model of 
spin glasses, still remains mathematically quite elusive. 

Despite the spectacular proof by Guerra and Talagrand that Parisi's replica symme- 
try breaking scheme provides the correct free energy for the SK-model, many aspects of 
the Parisi ansatz continue to present major challenges. In fact, the appearance of seem- 
ingly universal features, such as the Derrida-Ruelle hierarchical structures, the (related) 
ultrametricity, the law of the pure states, are still far from being understood. 

We hope to gain some modest insights into these issues by considering generalizations 
of the Random Energy Model (REM for short), that is, models with Hamiltonians given 
by independent random variables. Our generalization is different from the "general- 
ized random energy model" invented by Derrida. It can be analyzed by large deviation 
techniques. Despite its simplicity, it exhibits a number of interesting properties, like 
asymptotic ultrametricity, Poisson-Dirichlet description of the pure states, chaos in tem- 
perature, and a non-trivial dependence of the overlap structure on the temperature. The 
free energy is given by a Parisi-type formula which naturally can be linked to a Gibbs 
variational formula via a kind of duality relation which makes apparent why an infimum 
appears in the Parisi formulation. 

The second part of this work presents a particular mean field spin glass which we 
call the "REM+Cavity" . R is related to the random overlap structures of Aizenman, 
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Sims and Starr pQ; but, instead of taking the thermodynamical limit in the REM first 
and then perform a one spin perturbation of the Derrida-Ruelle structures, we perform 
a cavity field perturbation on the finite systems first, and only subsequently do we take 
the thermodynamical limit. As a first step, we stick here to the simplest finite size 
counterpart of the Derrida-Ruelle structures, the Random Energy Model [5]. Our model 
shows a delicate phase transition where Replica Symmetry is broken and ultrametricity 
sets in. In the low temperature region massive pure states emerge, with law being 
given by the Poisson-Dirichlet distribution. The model also displays chaotic behavior 
in temperature. The natural extensions of our approach to models with more intricate 
dependencies than those of REM-type (such as for instance the Generalized Random 
Energy Models) turns out to be quite a subtle. We will address this issue in a forthcoming 
paper. 



2 Mean field models of REM-type 

Consider a double sequence X a ^,a,i > 1, of i.i.d. random variables with a distribution 
/j,, taking values in a Polish space (S,S), and which are defined on a probability space 
(f2, J 7 , P). For N € N, a > 1, the empirical distributions is defined by 

def 1 N 



N 



i=i 



which takes values in A4f(S), the set of probability measures on (S, S), which itself is a 
Polish space when equipped with the weak topology. Let $ : Mf — > M be a continuous 
function. We write 



Z N d =f 2- N £ exp [N$(L Nta )] , n) d ^ f 1 log Z N . 



a=l 



N 



Theorem 2.1 

The limit /($, (J,) = limjv^oo /jv($ 5 /^) exists P — a.s., is non-random, and is given as 
/($, M ) = sup{<I>(^) - H{u | : H{u \ n) < log 2} , 



where H is the usual relative entropy H{v \ fx) = f J log J dv ifv^ifj, and log(du / dfi) G 
L\{n), and = oo otherwise. 

We specialize to linear functionals $ {u) = j (pdv, (ft : S — > M, i.e. 

N 



Z N = 2- N Y,^V 



i=i 



(2.1) 



In order that $ is continuous, we have to assume that (ft is bounded and continuous, a 
condition we want to relax somewhat. By a slight abuse of notation, we write $^{4*,^) 



2 



for the free energy of the finite-size system, and /(</>, n) for its limit, which by Theorem 
12.11 is given through 

f(cf>, n) = sup y cj>{x)v{dx) - H{v | fj) : H{u \ fi) < log 2| , (2.2) 



at least if 4> is bounded and continuous. We shall refer to expression (12.2(> as the Gibbs 
variational principle (GVP). Let us write for a distribution v G A^^(S'), and h : S — ► R, 

^[^] / h{x)v(dx), and for m G R, r^(m) = f logE 1 ^ [e m< ^] , which we always assume 
to exist. We also define the probability measure G m on S by 

dG m def e"^ 



cfyi Z (m) ' 
^ (m) is the appropriate norming constant. 
Theorem 2.2 

Assume 6 : S — > R is continuous, and satisfies 



(2.3) 



A *d/x < oo (2.4) 



e 

for aJi reai A. Then 

lim .M<M = /(<M, (2.5) 

/ given by ij2.2j) . Furthermore, there exists a unique maximizer of the right hand side of 
&2.2\) in the form G mt where m* G (0, 1] is characterized as follows: If 

r;(i)-r>(i) <i og 2, (2.6) 

then m* = 1. Otherwise m* G (0, 1) is the unique solution to the following equation: 

mT'^m) -T (m) =log2. (2.7) 

If m* = 1, i.e. (|2.6p holds, we say the model is in high temperature, and otherwise in 
low temperature. 

For the Sherrington-Kirpatrick model the free energy was originally obtained by 
Parisi using the replica method, and a special ansatz for the so-called "replica sym- 
metry breaking". The physical content of Parisi's functional is still somewhat mys- 
terious despite of considerable progress made later. In our setting, the nonrigorous 
RSB-mechanism would yield the following free energy for a spin glass of the form (|2.ip 
as 

Parisi (</>, M ) d ^ inf me[0 ,i] { ^ + - log - log 2) , (2.8) 

The fact that one takes the infimum instead of the usual supremum in the Gibbs for- 
malism is at first sight rather puzzling. However, in our setup, the identification of (|2.2p 
with the right-hand side of (|2,8p will be rather straightforward, and we have 
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Theorem 2.3 



f((j>,H) = Parisi(0,/x) 

We learned from Guerra [8] a simple argument how to prove that / (cj>, /x) is bounded 
by (J2SD: For m G [0,1], 



/jvC^M) = ^log ( 2-^^exp 



where the last bound follows by straightforward convexity /concavity arguments. Taking 
expectation w.r.t. the randomness, exploiting concavity of the logarithm, the indepen- 
dence of the random variables appearing in the sum X)j0(-Xa,i) 3 an d optimizing over 
m £ [0, 1] one easily gets 

E/jvOM <Parisi(^/x), (2.9) 

uniformly in N. We will not use that, and we will give another proof of Theorem 12.31 in 
Section EH 

As usual, the Gibbs measure is defined by 

def 2-^exp [N$(L N>a )] N 

y$,N{a) = , 1 < a < 2 . 

An 

We analyze this only in the linear case $ (u) = j cj)dv. By an abuse of notation, we write 
it simply as Q^^ipt). 

We recall the definition of the Poisson-Dirichlet point process with parameter m S 
(0, 1) . We first consider a Poisson point process on R + with intensity measure t~ m ~ 1 dt. 
We call such a point process a PP (m) . This point process has countably many single 
points with a maximal element. If we order the points downwards, we obtain a sequence 

of random variables £i > £2 > • • • • If m < 1, then £ = f Yli & < 00 > almost surely, and 
we can define ^ = f Ci/C- Then £A ^f. is a Poisson-Dirichlet point process with parameter 
m. We write PD (m) for such a point process. 

Theorem 2.4 

Suppose that <j>, u, are such that the system is in Jow temperature, i.e. < 1. (m* the 
unique solution to the entropy condition \2. 7]) ). Assume furthermore that the distribu- 
tion of 4> under /j, is non-lattice. Then the point process Z~2 a dg^ N (a) converges weakly 
as N —> 00 to a PD (mA 

Remark that Theorem 12.41 accounts for some universality of the Derrida-Ruelle struc- 
tures and the so-called Poisson-Dirichlet distribution, which naturally arise in the weak 
limits of the Gibbs measure associated to a REM-system in low temperature. 
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3 The REM+ Cavity model 



def 

We give an application of the previous results. Let again N G N. We set £ n = 
{1, . . . , 2^} and consider on some probability space (f2,jF, P) a sequence (X a ,a G Sat) 
of independent, centered Gaussians with variance N, as well as another independent 
sequence (g a; i,a G ^N,i = 1, . - - , AT) of independent standard Gaussians. For a G 
Ejv, cr = (ci, . . . , oat) G {±1}^, we define the Hamiltonian of the REM+Cavity-model. 

N 

H(a,a) = X a + ^2g ati ai. (3.1) 



H(-,-) is thus a Gaussian field on x {±1}^ with covariance given by 
E[H(a,a)H(a,a')} = N5 a=a , + N5 a=a ,q(a,a'), 

where q(a, a') = f YliLi a i a i 1S the usual overlap of the configurations a, <t'. For /? G M, 
the inverse of the temperature, we define the free energy 



M/^liog 



2- 2JV J>xp(/?tf(a,a)) 



(3.2) 



Proposition 3.1 

The limit f((3) = liniTv^oo In ((3) exists P-a.s. and is given by 

P 2 if P < 

m | i?[cosh(/3g)"Mogcosh(/3g)] , „ ., R . fl 
m *+ R[m S h^™*l i0 & Z U P > P< 



/OS) 
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(3.3) 
(3.4) 
(3.5) 



£[cosh(/3 9 ) 

with (3 CT > being the unique positive solutions of the equation 

E[cosh((3g) log cosh(/3 5 )] = e^ 2 log 2, 
and for (3 > (3 CI , m+ = m*(/3) G (0, 1) is the unique solution of 

P 2 2 , £[cosh(/3 5 ) m logcosh(/3 5 )] 

The mechanism behind this formula is easy to understand. Remark first that we can 
write the partition function as 

2- 2 ^ £ exp ((3H(a, a)) = 2~ N £ e^2~ N £ exp \(3 £^ ^ 

TV 

= 2 -^^e^n cosh (^) 

O 1 = 1 

= 2" N exp (3X a + log cosh {/3g a , 

a L 

= 2~ N ex P ^ + iV y log cosh (/?) Liv,a (dx) 
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where 

i N 

def 1 V^S 

1=1 

The probability that for a fixed a, we have X a ~ t/A^ and La^q ~ v 

« exp [- Ay 2 /2 - JVfl> | m)] , 

being here the standard normal distribution. Arguing roughly in the same way as 
before, we conclude that 

/(/?) = sup Uy + / log cosh {py) v (dy) - y 2 /2 - H (v \ y) (3.6) 

:y 2 /2 + ^(^|/i)<log2}, 

which leads to the expression in Proposition EHJ C v is the value for which the restriction 
y 2 /2 + H {v \ n) < log 2 becomes relevant in the supremum. An interesting feature is 
that for any j3, the a's which are giving the main contribution to the partition function 
are those for which 

v being the maximizer in the variational problem. We will give a precise derivation in 
Section 031 

According to the convention following Theorem 12.21 we call the region /3 < (3 CY the 
high-temperature, and (3 > (3 CT the low temperature regime. The associated Gibbs 
measure is: 

It is not difficult to realize that even in low temperature, the Gibbs weights of individual 
configurations are exponentially small in N. To get a macroscopic weight we must lump 
together exponentially many configurations. In the present situation, we have to take 
the marginal measure on the first component: 

<re{±i} N 

Proposition 3.2 

If (3 > P CT , then the point process ^ 5 (i) . . converges weakly to PD (m*). 

We thus witness in the low-temperature regime of the REM+Cavity the emergence 
of massive pure states, with law being given by the Poisson-Dirichlet distribution. 

We can also derive the limiting behavior of the overlaps under the replicated Gibbs 
measure G® 2 N - Following the physicists convention, we write (-)® 2 N for the expecta- 
tion with respect to G® 2 N - From the above proposition, it is clear that in the N — > oo 
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limit, Gp% (a = a') has the same distribution as ^rjf, where the rji are the points of 
a PD (m*) . Here a, a' are the first components of the two replicas. The expectation of 
Vi 1S we h known to be 1 — m*. Therefore, we get 



lim ^Qg' N I n - n 



(a = a') = 1 — m». 



(3.7) 



Conditioned on a / a', the overlap of cx, cr' is 0, in the N — > oo limit, whereas conditioned 
on a = a', it is given by 



def E [tanh 2 (/3g) exp (m* logcosh(/3g))] 
-E [exp(m* log cosh(/?g))] 



(3.8) 



for g a standard Gaussian and E denoting expectation with respect to it. To phrase it 
precisely 



Proposition 3.3 (Ultrametricity for the REM+Cavity) 

For (3>(3 CV , 



lim E 

JV^oo 



6 a = a > [q{<r,e') - q*)' 



0, 



and 



lim E 



2\ ® 2 

8 a ^a'q (o",cr') ) 



0. 



(3.9) 
(3.10) 



It should be remarked that our REM+Cavity model is noi ultrametrically structured 
for finite N, in contrast to the pure REM or the GREM. This means that the natural 
L2-metric on the Gaussian Hamiltonians is not an ultrametric on Sjy X {—1, 1}^ . 

An interesting feature of our model is that it exhibits the so-called "chaos in tem- 
perature" , in sharp contrast with the pure REM which does not have this property. The 
effect is easy to understand. For a temperature parameter j3 > /3 CT , Ga N picks from the a 

for which X a y^, L^^ u^, (y*p, being the maximizer of (|3.6p . y^, v*^ depend in a 

non-trivial way on [3. In particular, they change when (3 is changed, regardless how large 
[3 is. Therefore the contribution to the partition function is coming from a completely 
different set of a's if one changes the temperature parameter. This is in contrast to 
the REM where for (3 above the critical parameter, the a's which contribute are always 
those for which the X a are close to the maximal possible value. 
To phrase the property precisely, we have the following result: 



Proposition 3.4 (Chaos in temperature for the REM+Cavity) 

Assume (3, (3' > (3+ and (3 ^ . There exists 5 > such that 



G(3,N <S> G/3',N^a=a') > 



-SN 



< e 



-SN 



and 



lim E(5 a ^ Q 'q (a, a') ) = 0. 



(3.11) 
(3.12) 
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Summarizing, we have the following situation for the N —* oo the Gibbs measure at 
j3 > (3 CT : It gives macroscopic weights to a's for which Ljv, Q is approximately u%. The 
random weights are given by a Poisson-Dirichlet point process with parameter m*. If 
in the replicated system, a ^ a', then the corresponding a, a' have zero overlap with 
probability ~ 1. On the other hand, if a = a' , then also the a, a' have a non-zero overlap, 
given by g*. If /3 changes, then the choice is made from a completely different group of 
a's. 



4 Proofs of the main results 

4.1 The free energy for spin glasses of REM-type 

For the proof of Theorem I2.1|, a technical result is needed. Given A C M.\(S\ we set 

M N (A) d ^ #{a<2 N :L N , a £A}. 
We also write H(A) for ini uG A H(u \ fi). 
Lemma 4.1 

Let v E A4f(S), and V be an open neighborhood of v. If H(u | fi) < log 2, and e > 0, 
then there exists an open neighborhood U of v, U C V, and 5 > such that for large 
enough N 

F \m n (U) < exp [iV(log 2 - H{y \ (jl) - e)] 1 < e" 



-N8 



P M N (U) > exp [JV(log 2-H{v\u) + e)] 
If i^(^) > log 2, then there exist C/ and 5 as above, with 



< e 



-N6 



(4.1) 
(4.2) 



M N (U) + 



< e 



-N8 



Proof. Let first v G TW^S 1 ) satisfy i?(i^ | n) < log 2. The statement of the Lemma 
is trivial if v = fx, so we assume v ^ Let B r {y) C Mf(S) be the open ball of radius 
r and center u, where we have equipped Mf(S) with one of the standard metrics, 
e.g. Prohorov's metric. Then H(B r (v)) = H(c\(B r (v))), except for countably many 

r. Therefore we can find arbitrary small E\ > 0, and U = B r (v) C V, such that 
H(U) = H(cl(U)) = H(v | jj) — E\. From Sanov's Theorem, we have 



' (Ln,oc S U) > exp 



-N ( H(u | m) - g£i 



P(Ljv iQ G cl(J7)) < exp 
for large enough JV. Therefore 

EMiv(J7) > exp 



-N ( F(l I ft) - ~ £l 



N[log2-H(v\ f jt) + -e 1 
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EM N (U) < EM N (dU) < exp 
and using the independence of the Ljsi^a. 



N[\og2-H{u\ii) + -e l 



N[log2-H(u\ f i) + -e 1 



EM N (U) 2 < (EM N (U)) 2 + exp 

vai ¥ M N (U) < e~ Nei/2 (EM N (U)) 2 



(4.3) 



Hence, 



M N {U) < (l - e' Nei/8 ^j EM N {U) < exp [-Ne x /±] , 
M N (U) >(l + e~ N£l/8 ) EM N (U)] < exp [-iVei/4] . 



Choosing E\ smaller than e/2 and 5 = £i/4 proves the Lemma in this case. The case 
H(v | /i) > log 2 needs only a slight modification. In that case, there exists an open 
neighborhood U C V of v such that P(Lat jQ G U) is exponentially small in N, with a 
decay rate which is bigger than log 2. This proves the claim by the Markov inequality. 
■ 

Proof of Theorem 12.11 We first prove the lower bound. Let v be any element in 
M.f(S) satisfying H(u \ [i) < log 2. Let e > 0. As $ is continuous, we can choose an 
open neighborhood V of v satisfying | $(7) — §(y} |< £ for 7 E V. Applying Lemma [4. II 
we find a neighborhood f7 of v in V satisfying (14. lh . As 

Zjv > 2^ exp [iVinf 7e{/ $(7)] Afjv(?7) > 2- 7V e 7V( * (7) - £) M 7V (C7), 

we get from Lemma 14. II that P-a.s. one has eventually 

Z N > 2~ N exp [N ($(7) - e)] exp [N (log 2 - | fj,) - e)] 
> exp [N {$(1/) - #(1/ I fjt) - 2e}} , 

and therefore 

lim inf — log Z N > <&(z/) - H(u I fx) 

7V->oo N 

almost surely, for all v with H(y \ fx) < log 2. This proves the lower bound. 

We now prove the upper bound. We use the well-known fact that there exists a 
compact set K C Mf(S) such that F(L N K) < exp [-JV(log2 + 1)]. Let D N be the 
event 

2 jv 

D N = f| {L N>a G K} . 

a=l 

Then 

F[D%] <2 N 2e~ N ^ 2+1 \ 

and therefore 

liminf Dn = 1. 

iV— >oo 



9 



Fix e > 0. For any v € K, we choose 14 such that $(7) — 3>(f) < e for 7 G V^, and then 
U v C K, according to Lemma I4TT1 The U u cover K, and we choose a finite subcover, call 

dcf 

it U Ul , . . . , C4- m . Then, on D = liminiTv Dn we have, writing instead of U„ k , 
Z N = 2- N Y,^V [N$(L N>a )) 

a 
m 

= 2 ~ N J2 E «p[^*(^,«)] 

m 

< 2~* ^ exp [AT {$(^ fc ) + e}] M*(17) 

fc=i 

< ^ exp [JV {<S>{v k ) ~ H{u k I |u) + 2e}} 

k:H(u k \ f j,)<log2 

outside an event which has probability at most mexp [— iVmin,< m <5j], where the Sj 
corresponds to the Uj. From this estimate on gets that P-a.s. one has 

lim sup — log Z N < sup [&(i>) - H(u \ fj,)] , 

N-*oo u:H(u\fj,)<log2 

which together with the lower bound settles the proof of Theorem 12.11 ■ 

Proof of Theorem [231 To prove (|2.5|) . we cannot directly apply Theorem [XT] unless 
4> is bounded. We therefore truncate <fr by defining 4>m (%) = uiin (M, max (</> (x) , —M)) , 
M > 0, which is bounded and continuous. As a consequence of our assumption (|2.4|) . 
we have that for any e > 0, and K > 0, we can find M such that 

(0 (X^i) - 4> M (X a ,i))\ > eN^j < exp [-KN] . 

If we choose K > log 2, then with probability going to exponentially fast in N, there 
is no a such that |]TV (0 (-X^i) — 0m (X aj i))\ > eN. In particular 

E ex p [- E* ^ m ( Xq > 4 ) _ eiV ] - E ex p [- Ei ^ 

< J2 ex p [- Ei fa* ( Xa >^ + eiV 



Applying Theorem 12. II to and passing to the M — > 00 limit in the end, proves (|2.5[) , 
The more complicated claim is the one on the characterization of the maximizer. 
We first restrict the analysis of the GVP (|2.2p to measures G m of the form (|2.3j) . 

Restricting to the variational formula to these measures evidently yields a lower bound 

to the GVP, which actually reads 

sup{(l-m)r;(m) + r (m) : mT'Jm) - F^m) <log2). (4.4) 
met L J 
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We now claim that the target function (1 — m)T'Jm) + r^(m) is increasing on m E 
(—00, 1] and decreasing otherwise; in fact 

^ [(1 - m)r;(m) + T^m)] = (1 - m)r£(m) 

and r^m) > 0, Vm E M. Thus, we can restrict the search for a maximizing m G M in 
([HD to: 

sup {(l-m)r;(m)+r (/) (m) :mr;(m)-r (m) <log2| (4.5) 

me(-oo,l] J 

But monotonicity also implies that the (global) maximum is attained in m = 1, i.e. 
equals T^, (1) as long as the side condition 1^(1) — 1^(1) < log 2, i.e. we are in the high 
temperature region. 

In the low temperature region, i.e. if 1^(1) — 1^(1) > log 2, we first observe that the 
function m — ► mf^ra) — T^(m) is also increasing, this time for any value of m > 0: 



J- (mT'^m) - I>(m)) = mfj 



Hence, monotonicity of both target and constraint function yields that the maximum is 
achieved at the largest possible value, which is the one satisfying: 

mT'^m) -I>(m) =log2. (4.6) 

As the left-hand side is < log 2 at m = 0, and > log 2 at m = 1, continuity and strict 
monotonicity implies that there is a unique m+ G (0, 1) satisfying this equation. 

It remains to prove that any maximizer of (|2.2p is G mf . For an arbitrary probability 
measure v on S, we set 

i>{v) = E„(<t>)-H(v\n). 
We compute the entropy of v relative to G m : 

. 1 . . _ , du _ , ( dv da 

H{u G m ) = E u \og— = E y \og — • -f 
aG m \ a// aG r , 

= J ff(H^)-^iog^ 

= i?(f I fx) — mE u (J) + log Z(m) 
= H(u I //) - mE u <j) + mE Gm (j) - H(G m \ fx) 

the last equality stemming from the definition of the G m , according to which 

H(G m I M) = ™^G m M - log Z(m) 
An elementary computation yields 

m (V (G m ) - V (f)) = (1/ I G m ) + (1 - m) [JT(G m | /x) - £T(i/ | /*)] . 
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This is true for any m, but we apply in now to to = to*. Then either 1 — to* = 0, or 
F(G m , | fi) = log 2. Therefore, if H{v \ fi) < log 2, then the right hand side above is 
non-negative, implying that ip (v) < ip (G m ») > an d equality only if H [y \ G mt ) = 0, i.e. 
v = G mt . This proves the claim. ■ 

Proof of Theorem 12.31 To prove the equivalence of Parisi and Gibbs Variational 

Principles, we consider the function ^>(m) = f + ^logE^ [e m<?i ] — log 2. Recall that 
Parisi variational principle amounts to minimize \I/(m) over [0,1]. Clearly, either is the 
minimum attained on the boundary value to = 1, or in to solution of ^'(to) = 0. In 
case the optimal value is attained in to = 1, one immediately sees that inf mg [ ,i] ^f(m) = 
'I'(l) = r<i(l), thus exactly as in the high-temperature case of Theorem 12. 21 Otherwise, 
it is crucial to remark that 



Therefore, ^'(to) = iff H(G m \ n) = log 2. By Theorem 12.21 we already know that the 
generalized Gibbs measure associated to the solution of the latter equation is optimal. It 
is also a simple algebraic exercise to check that with to* such that ^'(m*) = one also 
has ^(m*) = r^m*) — log 2, showing the equivalence of Parisi and Gibbs variational 
principle in the low temperature well. ■ 

Remark 4.2 

The above considerations also show that the order parameter of the Parisi formulation, 
the optimal to* (with either to* = 1 or such that ^(to*) = 0), is in fact the inverse of 
temperature of the generalized Gibbs measure solving the Gibbs variational principle. 
Moreover, derivatives of the target function ^ in the Parisi formulation naturally appear 
in terms of entropies of the generalized Gibbs measures relative to the underlying random 
media. 

4.2 The Gibbs measure of spin glasses of REM-type 

Let m* be as defined in Theorem 12.21 and G = G mic the associated measure. We write 
v 2 := varg (cj)) for the variance of (ft under G. Then define 



where lu(N) = — ^- log V2irv 2 N. For a € {1, ... , 2^} let us also abbreviate Hn{o) = 

V;V'(.Y„.,). 



We begin with a technical result: 
Lemma 4.3 

Assume that the measure fJ-cj) -1 on R is non-lattice. Then, with the above notations, for 
any t G R 



tf'(m) = ^{H(G m | M)-log2}. 

Ill ^ 



a N = E G (<j>)N + u{N) 



(4.7) 




lim 2 7V P \ <j)(X lii )-a N >t 




(4.8) 
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Proof. We use the usual transformation of measure argument, writing the proba- 
bility on the left-hand side of (14. 8p in terms of a new sequence °f independent 

random variables with distribution function G. As G is equivalent to /x, also (j> (^X^j is 
non-lattice. We write Gn for the distribution of 



N 



i=l 



Y,U{Xi)-E G ^) 



and Gm for the standardized one: Gn (•) = Gn (vvN-j . 
By change of measure and integration by parts, it holds: 



exp iV 



EN 
. <j>(X lti ) -a N >t 
i=i 

Recall that in low temperature, 

log£(e m ** 

and therefore 



log£[e m ^] -m+Eatt) 



-m*y 



G N {dy) 



log 2, 



6(X lti ) - a N > t = [ 
^*=i Jt- 



-m*y 



t+Uj(N) 



t+u(N) Jm*y 



G N (dy) 



e z dzGN(dy) 



dze~ z G 



N 



m*(i+tj(iV)) 



t + u(N) z 



Nv 



Nvm* 



We use now the assumption that (j)(Xi) has a non-lattice distribution. This implies 
that we can approximate the distribution function of Gm by a smooth distribution 
function, up to an error of order o (iV -1 / 2 ) . More precisely, if we define GaussDF to be 
the distribution function of the standard normal distribution, and (p the density, then, 
uniformly in x, 



Gn ((— oo, x]) = GaussDF (x) 



M3 



6v 3 VN 



(x 2 -l)ip(x)+o[N- l ^y (4.9) 



where ^3 is the third moment of <fi(Xi) — Eg ((/>)■ (see e.g. |6], Theorem XVI. 4.1). If we 
replace Gn above by the standard normal distribution, and transform back, we get an 
expression 



m. 



exp 



Nv 



exp 



1 

~2Nv 



(z - m+Nv + u(N)Y 



dz 
\/2nNv 
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In order to prove the lemma, it therefore suffices to prove that the two error summands 
in (|4.9p contribute nothing in the N — ► oo limit. This is evident for o (iV -1 / 2 ) as 



dze~ z = 0[Vn 



For the middle Edgeworth term in (14.90 . the special form is of no importance, and we 
only use that it is of the form h (x) /^/N with a bounded smooth function h : 



I 

J n 



dze 



m*(t+w(N)) 



h 



Nvm* 



h 



t + u(N) 



Nv 



O(l). 



Proposition 4.4 

Within the above setting: 

a) The point process ^ Q ^H N (a)-a N converges weakly to a Poisson point process with 
intensity measure e~ m * t dt. 

b) The point process J2 a ^ P [H N ( a )-a N ] converges weakly to a PP (m*) . 

Proof, b) is evident from a). For the first claim, we shall exploit the equivalence 
of weak convergence and convergence of Laplace functionals. For a continuous function 
with compact support F G C C (R) we have: 

Ee « = lEe- F ( HN ^- aN H 



{l-Ejl-e-^W- ^]} 



By Lemma [4T31 this converges to exp [—J (l — e F W) e m * t dt] , settling a). ■ 
Since 

exp [H N (a) - a N ] 

yN,4>{ a ) = ^ \u ( i\ T> 

L«' ex P l H N{a ') - a N \ 

to prove Theorem 12.41 it suffices to prove that in low temperature the normalization 
commutes with taking the N — > oo limit. For this, we have the following: 

Lemma 4.5 

Suppose 4> is such that the system is in low temperature, and let e > 0. There exists 
C > such that 



[£ a GXP - l H N (a)-a N £[-C,C] > £ 



for large enough N . 
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Proof. First, by Lemma 14.31 we clearly have that 

< P [3a E Sjy : H N (a) - a N > C] 

< 2 N ¥ [H N (1) -a N >C}< const xe^ c 



[£ a e HN{a) - aN lH N{a) - aN >c>e 



for large enough N, which can be made arbitrarily small by choosing C large enough. 
So, it remains to prove that we can find C such that 



[l^ a e 1 H N (a)-a N <-C > £ 



- 2 



To see the last inequality, remark that the left hand side is bounded by 



3 ^jv(l)-ajv ]_ 



-E 



We proceed along the lines of Lemma 14.3 



H jv ( a ) — a jv < — C 



2 iv E (e^W- a -l Hjv(1 )- ajv <- C> 

-C+a;(7V) 

= log B[expm<^]-7VB G (^) e -a;(Ar) /" e^ l ~ m *^ y G n (dy) 

— oo 

-C+<j(AT) 



We again use (I4.9p . Important is now only that Gat, up to a signed measure -Rat of 
total variation o (iV -1 / 2 ) , is given by a (signed) measure (^y/^/N^j dy/y/N, where 
hjsr bounded, uniformly in N. Using that m* < 1, we get for the smooth part 



-C+u>(N) 



h N (y/VN 



N 



dy < const x • 



-u)(N) 



N 



const xe"^ 1 -" 1 *)- 



1 



dy 



-m+u>(N) 



(1 - m*)ViV 



const xe" 



-C(l-m») V 2 ^ 2 



1-771* 

Taking C large, we can make that arbitrarily small. For the i?7\r-part, we have by partial 
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integration and Fubini: 



-C+u>(N) 



-C+u{N) 



/ e (1 - m * )y R N (dy) 



RN(dy) 



dze z 



(l-m*)(-C7+w(JV)) 



-uj(N) 



dze z 



x Rjy 



1 — m. 



-,-C + u>(N) 



o e -u(N) exp [(1 _ m j (_ c + w (A r))] 



o(l). 



Proof of Theorem 12.41 We denote by M the space of Radon measures on (0, oo) 
endowed with the vague topology. By Hn we denote the point process associated to the 
collection of points (exp[Hj^(a) — aw], a £ Sjy) and 7i be its weak limit. We choose a 
continuous function h : M + — > M + with /i(x) = x for x G [1/C, C], /i(x) < x \/x and 
/i(x) = for x ^ [1/2C, 2C]. Then J hdTiiy converges weakly to J hdTL by continuity of 
the mapping M 3 S — ► J hd3. By Lemma l4.5| to e > we can find C > large enough 
such that 

»1/C |>oo 

xdTiN + / xdTiN > £ < £, 
'o 7c 

uniformly in iV, from which we see by approximation that Jq°° xdTL^ converges weakly to 
J Q xdTi. This also implies that (TLn, j xdJiw) converges weakly towards (H., f Q x dH.). 
Theorem 12.41 then clearly follows from the continuity of the mapping M x (0, oo) — » M 

defined through (H, a) t— > H^" 1 with # a : (0, oo) — ► (0, oo) and 6> a (x) A = x/a. ■ 



4.3 The free energy of the REM+Cavity 

Proof of Proposition 13.11 Performing the trace over the Ising-spins we have: 



exp 



px a + ^2 l °S cosh((3g 



(4.10) 



and we write the X^s as the sum of N independent standard Gaussians X a i, i.e. 



X a = Yld=iXa,i- By Theorems 12.21 and 12.31 applies to the function cj){x\,X2) == 0X\ + 
log cosh(/?X2) and /i(a?i,X2) a standard, bivariate Gaussian. 

The high temperature region T'.(l) — 1^(1) < log 2 is equivalent to 



dcf 



E [cosh(/3#) log cosh(/?s)] < e^ 2 log 2, 
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g a standard normal, and E here the expectation with respect to g. So we have to identify 
this region. We prove that there is a unique f3 CI > such that this inequality holds if 
and only if j3 < /3 cr . To prove that, consider 

H(j3) = e- /32 / 2 J B[logcosh(/3 5 )cosh(/5 9 )]. 

Then 

— = -/3 e - /32/2 J B[cosh(/3 5 )logcosh(/3 5 )] + e~ 132/2 E[gsmh(Pg)] 

+ ^l 2 E\g log cosh(/3 5 ) smh(/3g)] 

= (3e-P 2/2 {E[cosh(f3g)] + £[smh 2 (/? 9 ) cosh^)" 1 ]} , 

by Gaussian partial integration. So the derivative of H is positive. It is easy to see that 
lim^oo H ((3) = oo. So it follows that there is a unique f3 CT > such that H (/3 cr ) = log 2, 
and H (/?) < log 2 if and only if f3 < (3 ct . 

If < Per, then / (/?) = \ogE fl e m< t' = /3 2 /2. If > (3 CT , then we have to determine m* 
according to (|2.7j) which gives (|3.5f) , and plug it into (12.8(> which gives the expression for 
the free energy (|3.3p . ■ 



4.4 The Gibbs measure of the REM+Cavity 



Proof of Proposition [3721 Performing the trace over the Ising spins, the Gibbs weight 
of the pure state a S {1, . . . , 2^} in the REM+Cavity reads 



N 



(3X a + log cosh {Pg a ,i) 



i=i 



^exp 



N 



/3X a , + ^2 lo § cos HP9a> 



i=l 



As in the proof of the Proposition 13.11 we may then replace the X a with Y2i=i X a ,i f° r 
a double sequence of independent standard Gaussians X a> i, and then we apply Theorem 
12741 ■ 

For the proof of the other results, we need some remarkable properties of the point 
processes PP (m). 

Lemma 4.6 

Assume that {v a } a& ^ are the points of a PP (m). Consider also, independently of this 
point process, a sequence {(U a , V a )} aeN of i.i.d. two dimensional square integrable 
random vectors satisfying V a > 1. Then the following formulas hold: 



E 



E 



£ Q v a v a 

(E a VaV a ) 2 



E [UV 



m* — 11 



171- 



E [V m *] ' 

/ e [UV" 1 *- 1 ] 
' I E [V m *] 



(4.11) 
(4.12) 
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F v 2 U 2 

(Ea v oYa 



E 



(1 - m*) 



(4.13) 



For a proof, see [12, Theorem 6.4.5]. 

Proof of Proposition 13.31 We fix some notation. Let 



N 



W 



(1,2) 



exp /3X Q + ^2 lo S cosh(/?^ 



i=3 



stand for the (not normalized) Boltzmann weight of the pure state a with a cavity in the 
sites i = 1,2, and the "centering constant" being given by (|4.7p specialized to the present 
setting. Remark that these weights somewhat differ from the original ones without the 
cavity, but we clearly still have, in total analogy with Proposition 14.41 weak convergence 

(1 2) 

of (wa ' ) towards a collection (v a ) distributed according to a PP (m*) . 

For the proof of claim (|3,9|) . expanding the quadratic terms, by symmetry and obvious 
bounds we have: 



E 



(q(a,a') - <?*)' 



E 



r / /\®2 



P,N 
2 



u a=a' 



= Ai - A 2 + yl 3 , say. 

meaning that the quotient converges to 1, as N — > oo. 
The point process (^ 1 ]^(a)) Q converges to a PD (m*) which implies 



E 



~ 1 — m+. 



(4.14) 



For Ai, we get 



E 



E 



E Q {sinh(/3c/ Qi i) sinh(/?c/ Qi2 )} 2 (u>a' 2) ) 



( 



(1,2) 



X) a cosh(/35r a> i) cosh^a 2 )u4 



E Q {sinh(/3c/ Qi i) sinh(/?£/ Qj2 )} 2 (wi 1,2) j 

^(1,2) 



(fl 2 
^ a cosh( / 5g a) i) cosh(/35 Q , )2 )^a ' 

where wi 1 = f wi 1 ' 2 ^ / Yl a ■ The corresponding point process converges weakly 

to PD (m*) , and taking f7 Q = smh(f3g a ,\) sinh(,% aj2 ), = cosh(/?g Qj i) cosh(/?g Qj2 ), a 
simple domination argument shows that one can pass to the N —* oo limit, replacing 

fl 2) 

the u7q ' by the points of this point process. Applying then (14.131) . one gets 

E [tanh 2 (/%i)tanh 2 (/? 52 )cosh(/%) m * cosh(/3 52 ) m *] 



(1 - nii, 
(1 - m*)g 2 . 



£ [cosh(/35i) m * cosh(/?c/ 2 ) 
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In a similar way, one proves 



A 2 ~ 2(1 - m*)ql 



This proves ()3.9|) . (|3,10p follows similarly. ■ 

Proof of Proposition 13.41 Consider for the moment the /3-system only. Denote by 
f((3) the free energy and by G m ip\ the associated extremal measure solving the GVP. 

For s > 0, set Bp i£ = f S e (G mjr (m) C .A/^(R 2 ) for the open ball of radius e and center 
Gj^my For a £ we denote by L^, a the empirical measures associated to the free 
energies of the pure states. 

We first claim that given e > there exists 5 > such that 



Gp tN (aeS»: L N<a i BpJ > e~ m j < e"*" . (4.15) 

To see this, first observe that uniqueness of the maximizers solving the GVP implies 
that, with <f)(x\,X2) = (3x\ + log cosh(/?X2) and \i standard bivariate Gaussian, 

f(P,e) = mp{EM - H(u | M ) : H(u \ (j) < log 2,*/ £ B^} < /(/?). (4.16) 
Using the same argument as in the proof of Theorem 12.11 we get 

lim — log 2~ N V eP x <*+T.?=i iogco S h(^ aii ) < ,fa s p _ 

Using the variance estimate from (|4.3p and the Tchebychev inequality, it is easily seen 
that 

F[A c N (f3,e)}<exp(-6N), (4.17) 

for some 5 > 0, where 



2 -N 



2~ N e /3Xa+ ^ logcosh(/39Q ' l) < /(/3,e) + -, 

3 

oc:L Nta <£Bf3 tE 

^ e ^ tt +E i iogco S h(^ a , i ) > + 2^ I 

a J 



where r\ = f — /(/?, e). This clearly implies (|4.15|) . 

If (3 7^ /?', then we can choose e > such that -B^^ n -Bg' e = 0, and the claim a) 
follows. 

As for claim (I3,12|) . we observe that 

E (S a ^q(a, a') 2 )Z',N = E ^)Z',N ~ E (<W<^ ■ (4-18) 

Since q(a, a') 2 < 1 for all a, a', by claim a) of this Proposition the second term on the 
right hand side in (|4.18p is in the limit N — > oo vanishing. As for the first term on the 
right hand side of (|4.18p . by symmetry and obvious bounds we have 

E (q(a, a') 2 )Z>,N = i 1 + E (*1<W 2 )£^ )JV + 0(l/iV). (4.19) 
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Let us now set w, 



(l,2,/3) def 



exp (3X a + £\ =3 log cosh (0g a ,i) - a N ([3) 
mann weight of the pure state a with a cavity in the sites % = 1,2 associated to the 
/3-system, and ajv(/3) being the centering constant from (|4.7jl specialized to the setting. 

Analogously, we write Wa' 2 '^ ^ = f exp 



for the Boltz- 



in case of 



P'X a + Er= 3 lo § ™sh((3'g aii ) - a N ((3') 
the /^'-system. With this notations in mind, we write the expectation on the right hand 
side of flUBD as 



El i i \ ®2 

T, a sin HP9a,l] 



X 



lsinh(^ i2 )«;J 2 ' /3 ' ) 



Y, a cosh(/3g Qi i) cosh(/35 Qi 2)wi 1,2,/3) £) w , cosh(/?'fi( a / i i) cosh(/3 / g Q / i2 )ti/ 1 ; ' 2 ' /3 } _ 



E 



By Proposition 14.41 b) the Point Process associated to the collection of "points" of the /3- 
system (wi 1,2 '''') converges weakly to a PP (m*(/3)), while the Point Process associated to 
the /^'-system converges to a PP (m*(/?')). On the other hand, using similar arguments as 
in the proof of claim a) it is not difficult to see that the limiting point processes are in fact 
independent. [Given a compact set K C M+, the P-probability to find a configuration 

(1 2 8) (1 2 8') 

a £ Sjv such that Wa ' E K and simultaneously Wa ' ' G K is exponentially small 

in N .] Hence, the right hand side of (|4.19p converges with N —* oo to the product 



E 



Y^g sinh(/3g a; i) smh{f3g a ^)w a 
Y ia cosh(/?5r aj i) cosh(/3g a)2 )w 



x E 



Eg smh(P'g aA )smh(/3'g a ^)w' a 
J2 a cosh(P'g aA ) cosh((3' g a:2 )w' a 



with (w a ) a PP (m* (/?)), and (it4) OI " a ( m *(P'))- By (|4.1ip both expectations are 
seen to be equal to zero. This settles claim (|3.12p of Proposition | 
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